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Summary. The large surveys and sensitive instruments of modern astronomy are 
turning ever more examples of variable objects, many of which are extending the 
parameter space to testing theories of stellar evolution and accretion. Future projects 
such as the Laser Interferometer Space Antenna (LISA) and the Large Synoptic 
Survey Telescope (LSST) will only add more challenging candidates to this list. 
Understanding such objects often requires fast spectroscopy, but the trend for ever 
larger detectors makes this difficult. In this contribution I outline the science made 
possible by high-speed spectroscopy, and consider how a combination of the well- 
known progress in computer technology combined with recent advances in CCD 
detectors may finally enable it to become a standard tool of astrophysics. 



1 Introduction 

High-speed photometry has a long history in optical astronomy. The late 1960s 
and early 1970s saw the combination of digital recording techniques and pho- 
tomultiplier tubes to give photon counting high-speed photometry, e.g. [30j . 
Equivalent high-speed spectroscopy has never been as straightforward and is 
still not a standard technique. There are several reasons for this. First, it is 
technically more difficult and expensive to record spectra at high-speed as 
one needs fast 2D imagers rather than single pixel devices. An early example, 
which was developed for faint object spectroscopy but could also take fast 
spectra, was the Image Photon Counting System (IPCS) [2 . The IPCS illus- 
trates a problem typical of high-speed spectroscopy, because although it could 
take spectra with exposure times well below 1 second, it was rarely used to 
do so because its design limited its maximum count rate in any given pixel 
to less than 1 photon/second. A second difficulty of high-speed spectroscopy 
in the era of the IPCS was simply that it could produce more data than the 
computer technology of the 1970s and early 1980s could easily deal with; the 
author of this chapter recalls an observing run on the Anglo- Australian tele- 
scope which produced enough magnetic tape to run to and from the local 
town Coonabarabran 40 km away twice over. Computer technology has since 
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taken enormous strides of course, but the 1980s brought the biggest obstacle 
of all for high-speed spectroscopy, namely Charge-Coupled Devices (CCDs). 
Although wonderful detectors in many respects, the CCDs employed in most 
observatories have two significant disadvantages: they are slow, often taking 
several tens of seconds to read out, and significant noise is added to each 
pixel, burying the small signals characteristic of high-speed spectroscopy. The 
slow readouts of standard CCDs has lead to a small revival of the old photo- 
graphic method of "trailed spectra" where the target is moved along the slit 
to give time- resolved spectra [131 [3S], but the disadvantages of this method, 
such as seeing-dependent time-resolution, mean that it is really only a stop 
gap measure and I will not consider it further. 

Despite the problems of CCDs, their other excellent characteristics, in 
particular their high quantum efficiency (QE), have lead to their complete 
dominance of optical astronomy, and so the focus of this chapter will be tilted 
towards the use of CCDs for high-speed spectroscopy. Moreover, CCDs are 
so dominant that they provide us with an empirical definition of what "high- 
speed" means when applied to spectroscopy, i.e. any spectroscopic observation 
that is difficult to carry out with normal CCDs is "high-speed" . This does not 
always mean very fast: readout noise can make echelle spectroscopy of an 18"^ 
magnitude object difficult to carry out with a time resolution shorter than 10 
minutes. 

With the above broad definition in mind, an overview is given of the sci- 
ence that high-speed spectroscopy makes possible followed by the technical 
requirements and the possible application of electron-multiplying CCDs to 
this area. 



2 Scientific Motivation 

As the introduction hinted, for a variety of reasons, high-speed spectroscopy 
has yet to take off fully and many applications exist only in the imagination. 
There are nonetheless a reasonable number of published examples which give 
an idea of what to expect, and in addition we have the many applications of 
high-speed photometry to draw on as a resource when considering what can be 
learned from high-speed spectroscopy. In this section I will look at what high- 
speed spectra can tell us when applied to the following phenomena, sticking 
as far as possible to what is already known: 

• pulsating white dwarfs and subdwarf B (sdB) stars 

• accreting binary stars 

• white dwarf binary stars 

Before doing so, it is worth mentioning objects yet to be discovered because 
surveys are being planned to look for short-timescale variable phenomena 
which are bound to discover many objects that will need high-speed spec- 
troscopy. Pre-eminent perhaps, although still far-off, is the Large Synoptic 
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Survey Telescope {LSST), in which it is planned to use an 8.4 m telescope to 
survey the observable sky once every 3 nights in 15-second exposures. 

2.1 Pulsating white dwarfs and sdB stars 

As well demonstrated by hehoseismology, pulsations in stars can give us unique 
insights into stellar structure. The area of asteroseismology tries to use pulsa- 
tions in other stars in this manner. It is observationally demanding first and 
foremost because of the need for long, uninterrupted time series in order to 
obtain clean time series. In the case of white dwarfs and sdB stars, which 
have pulsation periods of order 100 to 1000 seconds, fairly fast observations 
are also a requirement. If all goes well, it is possible to pin down stellar pa- 
rameters with great precision. For instance studies of the pulsating sdB stars 
PG0014-I-067, PG1219+534 and Feige 48 have measured their surface gravi- 
ties to ~ 1%, their masses to ~ 2% and also determined the masses of their 
hydrogen envelopes, for which no other method exists, to ^ 10% [H [HI [Z]- 
Despite this, even large datasets can leave stars unsolved, especially if they 
display few pulsation modes, the difficulty being the secure identification of a 
particular eigenmode with a given frequency. 

It is unlikely that spectroscopic data can ever be taken with the same time 
coverage and uniformity as the photometric data. However spectra can add 
extra information missing from the photometric studies because the variation 
of limb darkening with wavelength in combination with the different patterns 
of different eigenmodes can lead to different variations of pulsation amplitude 
with wavelength |34j • Figure [1] shows predictions [9 of the amplitudes of 
different eigenmodes for the ZZ Ceti star G29-38. This shows that spectra may 
allow I = 3 and / = 4 eigenmodes to be distinguished from / = 1 and I — 2 
eigenmodes. A signal-to-noise ratio of 1000 only gives a signal-to-noise of 10 
in the amplitude spectrum when the amplitude itself is of order one percent as 
it is in ZZ Ceti stars, and therefore extremely high quality data are needed to 
separate / — 1 from 1 = 2. Thus this application requires moderately fast data 
acquisition to resolve the pulsations, but also a large aperture to give high 
signal-to-noise. Figure [2] shows the result of 4 hours of Keck II/LRIS time 
devoted to the brightest ZZ Ceti star, G29-38 {V = 13) [HI [9]. The mean 
spectrum of these data appears almost noiseless [9] , as might be expected for 
a bright star on a large telescope, but comparing Figs. [T] and [2| shows how 
necessary the large aperture is in this case. 

This case illustrates another important feature of high-speed spectroscopy: 
it does not have to be especially "high-speed" by photometric standards to 
strain the capabilities of standard instrumentation: the data were taken as 
700 X 12 second exposures, but for each exposure there were a further 12 sec- 
onds "deadtime" for readout, etc., so the observations were only 50% efficient. 
This is quite common; indeed 12 seconds is commendably short in this respect. 
Reducing deadtime is probably the single most effective way of enabling high- 
speed spectroscopy and would save countless hours of precious telescope time. 
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Fig. 1. The pulsation amphtudes versus wavelength for spherical harmonics Yim 
with different /-values. Each curve is normalised by its value at 5500A. The figure 
is Fig. 3 taken from [S] 
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Fig. 2. Spectra of the first pulsation amplitudes of the ZZ Ceti star G29-38 observed 
on Keck-II/LRlS by [41]. The figure is adapted from Fig. 4 of [9] 
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The end result of the work shown in Figs. [T] and [5] is that most of the modes 
are / — 1 modes. This shows the power of spectroscopy because in principle 
this method can be applied to stars showing just a few modes where it would 
certainly not be possible to unravel the modes purely photometrically. 

2.2 Accreting binary stars 

I now move on to systems which display variability on much shorter timescales 
than the white dwarf and sdB pulsators. The shortest dynamical timescales 
in accreting binaries with compact objects range from 1 to 10 seconds in the 
case of accreting white dwarfs down to about a millisecond in the case of the 
neutron stars and stellar mass black holes. Variability on seconds timescales is 
well established from the "dwarf nova oscillations" displayed by dwarf novae 
during their outbursts. To my knowledge, dwarf nova oscillations have only 
been observed spectroscopically with sufficient speed to detect them once [38] , 
but this single observation was enough to show the great diagnostic potential 
spectra may hold for these poorly-understood but remarkable phenomena. 

The variability timescales seen in black-hole and neutron star systems can 
depend upon the system brightness and instrumental limitations as much as 
the object. X-ray variability at kilohertz frequencies has been seen in X-ray 
binaries, while optical variability on timescales well below 0.1 seconds has been 
seen in bright systems [28j. Using ULTRACAM on the Very Large Telescope 
(VLT) we found significant flaring on timescales of a few seconds in a faint 
quiescent black-hole accretor (Shahbaz et al, in prep). One can only speculate 
upon the spectroscopic signatures of this variability at optical wavelengths 
as no such observations have been made, but it is very likely that there will 
be some, although in the quiescent black-hole case only an "Extremely Large 
Telescope" (ELT) would be up to the task. A case of particular interest is the 
fluorescent Bowen blend emission from the donor stars seen in some X-ray 
binaries and which in some cases has revealed the donor star for the very first 
time |36j . This emission is presumably X-ray driven, and can be expected to 
respond to X-ray variability; some evidence of this has been obtained using 
narrow-band photometry with ULTRACAM , but a much cleaner signature 
could come from high-speed spectra. 

A dramatic example of spectral variability from an accreting binary is 
shown in Fig. \3\ This shows the eclipse phases of the white dwarf accretor 
IP Peg in one of its temporary high states (outburst) [TH]. The lines from 
this system come primarily from the accretion disc and have the well-known 
double-peaked profiles that come from Doppler shifting in accretion discs [17] . 
The spectra plotted cover about 30 minutes and show very large changes as 
first the approaching side of the disc is obscured, affecting the blue-shifted 
parts of the lines, followed soon after by their red-shifted counterparts. The 
right-most panel of Fig. [3] shows a simple model which captures much of the 
phenomenology of the data while at the same time showing significant dis- 
crepancies. The nature of these is not understood although it is possible that 
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Fig. 3. The first and second panels show two of the emission lines of the dwarf nova 
IP Peg as they are eclipsed, with orbital phase increasing up the plot. The spectra 
were taken on Subaru during an outburst of IP Peg; the lines are dominated by light 
from the bright accretion disc during this state. The second panel shows a simple 
computation of the eclipse of a disc in prograde, Keplerian rotation. Figure adapted 
from Fig. 2 of US] 

the disc during outburst is far from Keplerian in nature. This example is an 
even starker indication of the problems often faced with facility instruments: 
IP Peg reaches V — 12 m outburst, yet despite using the 8 m Subaru, the 
time resolution here was a sluggish 80 seconds, each spectrum consisting of 30 
seconds exposure followed by 50 seconds of deadtime. The telescope, instru- 
ment and object would have allowed much higher speed than this. Ideally one 
would want to sample fast enough to resolve structure comparable in size to 
the white dwarf, which would be about 5 seconds in this case. Unfortunately 
the detector/data acquisition system was not up to the job, a not-uncommon 
situation, as instruments are rarely built with high-speed applications in mind. 

I finish off this section with a look at Doppler imaging as applied to ac- 
creting binary stars in the form of Doppler tomography [211 [22] ■ Doppler 
tomography uses the information in line profiles as a function of phase to 
image binaries. Notable successes have been the discovery of spiral structure 
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in accretion discs during outburst |371 114] and the unravelling of the complex 
accretion structures in the magnetic polar class of accreting white dwarfs [3S] . 
Doppler tomography provides a quantitative illustration of the need for high- 
speed spectroscopy and is useful in defining requirements for such work. This 
is because the resolution in Doppler tomography is limited by both spectral 
and time resolution. If one is aiming for a spatial resolution Ax, then this 
imposes the following restrictions on the spectral resolution R\ = X/AX and 
the time resolution At: 

i?A ~ {c/n){Axr\ (1) 

At - V'^Ax, (2) 

where the orbital angular frequency i? 27r/P where P is the orbital period, 
and V is the velocity of the feature being imaged. Small Ax requires large 
Rx and small At. These relations can be combined into a single one relating 
spectral and time resolution: 

Atr^Q-'^R-\ (3) 

Taking typical values P = 1.5 hours and V = 700kms~^, and assuming that 
we are working on a spectrograph with R\ = 10,000, then we require At < 
30 seconds in order that smearing during the exposures does not degrade the 
resolution. Equations [T] and [2] imply that the number of counts per detector 
resolution element per exposure scales as (Ax)"^ , and thus Doppler tomography 
can become challenging even on quite bright objects and large telescopes. 

2.3 White dwarf binary stars 

For my final example of applications of high-speed spectroscopy I turn to bi- 
nary stars with white dwarf components. This is another case where there are 
only a few examples, but where one can point to future applications which 
will prove tough for even the largest telescopes. Work over the past decade 
has established the existence of a huge population of detached, close double 
white dwarfs, with orbital periods of a few days or less [23, 29 . The shortest 
period of these is WD0957-666 with F = 88 minutes (Fig.H]) [3l|26l|2I]. There 
are estimated to be of order 100 million such systems in our galaxy, and they 
should be steadily spiralling towards shorter periods under the action of gravi- 
tational wave radiation. The existence of systems such as WD0957-666, which 
will merge within 200 million years, proves that there must be a population 
of much shorter period systems that we have yet to detect. This was realised 
some time ago and it is thought that this population, along with their accret- 
ing counterparts, the AM CVn stars, will be the dominant gravitational wave 
emission sources at frequencies of order 10"'^ Hz, right in the waveband of the 
Laser Interferometer Space Antenna (LISA) [T5l[3Tl|32] which unlike ground- 
based detectors is sensitive to gravitational waves of relatively long period . 
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Fig. 4. Trailed spectra around Ha of tiie sliortest period double white dwarf, 
WD0957-666, based upon the data of In the left panel, the brightest of the 
two white dwarfs is clearly seen; the right-hand panel shows the result of subtract- 
ing a fit to the brightest star which reveals its companion. The two stars will merge 
under gravitational radiation in about 200 million years 



LISA is predicted to be able to detect several thousand of these sources and 
will be able to narrow down the location of many of them to less than one 
degree [101 EH j and it should be possible to optically identify some of them. 
Optical follow-up can help in the determination of parameters from the LISA 
data, but will not be easy: the majority of sources will be fainter than V = 22, 
and yet have periods of order 10 minutes. Obtaining spectroscopy of these will 
clearly require very low noise detectors, even on the largest telescopes, includ- 
ing ELTs. For instance, when observing the 88 minute period double white 
dwarf WD0957-666 ^,on the 3.9m Anglo- Austrahan Telescope, our expo- 
sure times of 500 seconds were a compromise between smearing the spectra 
over too much of the orbit versus suffering too much readout noise, and this 
was on a relatively bright target with V = 14.6. This is an explicit example 
of the constraints discussed for Doppler tomography in section [2?2l 

A similarly challenging application for high-speed spectroscopy emerges 
from the eclipses of white dwarfs in detached white dwarf/main-sequence bi- 
naries. These are relatively easily studied photometrically [5], but the extra 
light losses and dispersion of spectroscopy make them a much tougher subject 
for spectroscopy and I am not aware of any spectroscopic studies which target 
the eclipses in these objects. There is nevertheless a compelling reason to do 
so which is that spectroscopy of the white dwarf as it goes in and out of eclipse 
has the potential to determine its rotation rate by measurement of the radial 
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velocity shifts induced by the eclipse. The rotation rate of white dwarfs in 
such systems has implications for the stability of double white dwarf binary 
stars [3S]. This requires taking spectra every 5 seconds or so (preferably less) 
and once again, for typical systems, telescopes and instruments, moves us into 
the realm of readout noise. 

This is an appropriate point to change topic and look at the technical 
difficulties of high-speed CCD spectroscopy. 

3 CCD spectroscopy 
3.1 Standard CCDs 

There are two key issues which make the use of standard CCDs difficult for 
high-speed work. First of all CCDs are usually slow to read out. It is not 
unusual for CCDs to be read out at ~ 100 kHz pixel rates, so that an entire 
chip can take several tens of seconds to be read out. Many chips can be read 
out faster, but then one suffers worse read noise, which can more than offset 
any advantage of high-speed. The amount of telescope time spent reading out 
CCDs is potentially frightening: consider a telescope which spends 8 hours per 
night, 365 night /year devoted to taking spectra of 8 one hour-long spectra 
per night together with calibration arcs before and after each one. If each 
spectrum takes 1 minute to read out, then 18 solid nights would be spent 
reading out the CCDs. Worse still, any programme that required exposure 
times shorter than 60 seconds would run at below 50% efficiency. As some 
of the examples of Sect. [2] showed, this is not as rare as one might imagine. 
Luckily there are ways around slow readouts. First one can group pixels (bin) 
and read out sub-sections of the chip (window). These are both often very 
effective. More radical is to use a frame transfer CCD allowing one half of the 
CCD to be read out while the other is being exposed. The EEV 47-20 CCDs 
used by ULTRACAM (Dhillon, this volume) allow most observations to be 
carried out with a deadtime of only 24 milliseconds using this technique. This 
would be more than adequate for the vast majority of feasible spectroscopic 
applications. 

This brings us to the more fundamental issue of readout noise. This plays a 
much more important role in optical spectroscopy than it does in photometry. 
The key quantity to have in mind is the variance on a given pixel which is 
given by 

V ^R^ + C, (4) 

where R is the RMS readout noise in electrons (typically ^ 3) and C is the 
number of electrons in the pixel, and is equal to the number of photons de- 
tected (which in general is the sum of target flux, sky background and dark 
counts, although the latter can usually be neglected). This is the simplest 
possible version of this relation and assumes perfect flat-fielding, but this is 
more often than not a reasonable approximation in the case of high-speed 
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work. Once C drops below ~ 9, one is starting to lose out significantly to 
readout noise. There are techniques to alleviate this: binning again is impor- 
tant. If one is observing a point source, then it makes no sense to over-resolve 
the spatial profile, and in fact under-sampling of the spatial profile is not al- 
ways much a drawback except for spotting cosmic rays, therefore binning in 
the spatial direction is often very useful. I have found that people often do 
not realise how significant an improvement binning spatially can make, but it 
not hard to demonstrate. Consider a case where two pixels with a total count 
C = are binned into one, and assume that C is dominated by the object 
(as opposed to sky background). Then the ratio of signal-to-noises, binned to 
unbinned, is (3/2)^/^, equivalent to a 50% increase of exposure time. Apart 
from this, the only other option may be to increase the exposure time: con- 
sider again the marginal case with C — when one decides to double the 
exposure time. Then it is straightforward to show that the improvement in 
signal-to-noise corresponds to the improvement that would be obtained with 
an 8/3 ~ 2. 7- fold rather than simply two- fold increase in exposure time in the 
zero readout (Poisson limited) case. Of course increasing the exposure time 
may not be an option; after all, one is trying to resolve intrinsic variability 
in a target. The only other capability one then has to affect the final signal- 
to-noise is not to make things worse by poor reduction; it is well worth using 
extraction techniques designed to optimise the signal-to-noise ratio [16l |2T] if 
at all possible. 

When is one readout noise limited? 

Despite the various ways in which one can combat readout noise, there is in 
the end no getting around it apart from changing the detector design which I 
look at next. Before doing so I pause to consider the parameter space where 
readout noise is important. Consider a telescope of diameter Z), feeding a 
spectrograph of resolution R\ leading to a detector with Nd pixels per res- 
olution element in the dispersion direction and pixels across the spatial 
profile (loose definitions, but it is the orders of magnitude that matter here). 
If e is the total throughput of atmosphere, instrument and detector in terms 
of photons detected versus those actually incident upon the atmosphere, then 
an exposure of t seconds of a target of AB magnitude m will produce C counts 
given by 



Assuming Nd = Ns = 3, Rx = 2000, D = 8m, e = 0.2, then to have C > i?^ 
for R = 3 electrons implies the following relation between m and t: 



One is therefore readout noise limited for a target of 20*'^ magnitude target 
and 30 second exposures on a typical spectrograph on (currently) the world's 
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largest telescopes. Moving to an echelle with, say, i?> = 60,000 and e = 0.1, 
then this limit must be shifted up by 4.5 mags. A simulation of the effect of 
readout noise is shown in Fig. O for a star of considerable current interest, 



Readout = 3 e Readout = e 




Fig. 5. A simulation of the trailed spectrum of the V = 21 ultra-compact binary 
star, RX J0806-I-1527, as taken with 1 night using the VLT and the F0RS2 spec- 
trograph (600 grism) with (left) and without (right) readout noise. Time advances 
upwards; the a;-range is centred on a single emission line. The exposure time was 
taken to be 30 seconds in order to resolve the presumed 321 second orbital period 
of the object, but detector readout time was assumed to be zero in each case. The 
emission line was taken to be about as strong as seen in the average spectrum of 

m 

RX J0806-I-1527. This is potentially the shortest period binary known, and 
a strong gravitational wave source, consisting of two white dwarfs with an 
orbital period of just 321 seconds [33l[T9]. This has yet to be proven however, 
and spectroscopy is by far the most promising avenue for testing it. It is 
however, as Fig. [5] shows, pushing the capabilities of even the VLT. The fact 
that as of mid-2006 this observation had yet to be performed is testament to 
its difficulty. Many would consider 1/ = 21 to be "bright", or at least, not 
especially faint, but it certainly is when one is forced to take short exposures. 
Observations of this sort are exactly those needed for the gravitational wave 
sources. Sect. 12.31 except they will in general be more difficult still. 

Having seen the limitations imposed by readout noise, I now turn to how 
it can potentially be avoided. 

3.2 Electron-multiplying CCDs 

The readout noise that so seriously limits normal CCDs is added by the ampli- 
fier which converts the small charge on each pixel into a voltage. To some ex- 
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tent readout noise can be reduced by taking longer over the double-correlated 
sampling used in low-noise CCD readouts (i.e. spending longer integrating the 
voltage levels before and after the charge on each pixel is cleared), but 1/f 
noise limits the extent to which one can push this and one cannot in any case 
take too long reading each pixel if one is interested in high speed. Electron- 
multiplying CCDs (hereafter EMCCDs, but also known as "low light level" 
or L3CCDs) get round this limitation in a clever way. In these CCDs, an 
extra series of stages is added to the serial register before the charge reaches 
the amplifier. These stages can be clocked with higher than normal voltages 
which creates a significant probability that one electron will generate another 
as the charges are moved from stage to stage. With enough stages, a single 
detected electron can lead to an avalanche of several hundred or even thou- 
sands of electrons. These then dwarf the readout noise added by the amplifier. 
The formulae for signal-to-noise from these devices are more complex than for 
standard CCDs [T], but it is necessary to review them here in order to under- 
stand the advantages and limitations of these devices. 

To first order, the avalanche gain register can be modelled as a series of 
stages at each of which there is a probability p that any electron will spawn 
another. If there are N such stages in total then the mean gain g, can be 



The value of p, and therefore g is a parameter that can be controlled by adjust- 
ment of the driving voltages. For example if we take N — 536 (appropriate to 
the CCD97 manufactured by the company, e2v) and p — 0.011, then g — 352. 
This gain is itself variable however (after it all it can in principle lie anywhere 
from 1 to 2^ ^ g although the extremes are unlikely) which increases the 
variance of the output over that expected from pure Poisson noise. Thus in 
a normal CCD readout, a mean detection rate of C electrons leads to the 
variance contribution of C in Q, while in an EMCCD the signal is amplified 
to gC on average and the corresponding variance is (g^ + (j'^)C where is 
the variance of the gain and is given by 



Had there been no dispersion in the gain then the variance would have been 
g^C, exactly what one would expect if simply multiplying C by a constant; 
the variance in the gain thus adss extra noise. If, as is the case in practice, 
p <C 1 and g ^ 1, then ~ g^, and the variance is a factor of 2 larger than 
a constant gain would have produced. Thus (g]) is modified to 



shown to 



5- (1+P) 



N 



(7) 




l-p 



(8) 



V^R^ + 2g^C, 



(9) 



^ Do not confuse this gain with the usual "gain" of CCDs which is simply a conver- 
sion factor between electrons and recorded counts or ADU. To avoid confusion I 
only ever talk here in terms of electrons not ADU. 
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and the signal-to-noise ratio for one pixel is given by 

C 

compared to 



(10) 



(11) 



^/WTC 

for a normal CCD. The extra variance can be thought of as being equivalent 
to a 50% drop in QE. Put this way it sounds bad, but comparing the above 
two relations for signal-to-noise, and assuming that g is so large that R/g 
can be neglected, one can see that the EMCCD gains once C < B? . In other 
words once C ^ R^, normal CCDs have also effectively lost a factor 2 in QE, 
but unlike EMCCDs normal CCDs carry on getting worse as C drops. This 
analysis also shows that ([5|) marks the dividing line between normal CCDs 
versus EMCCDs running in "linear" mode with counts proportional to the 
voltage of the amplifier. 

At very low count rates, the full QE can be recovered by operating in a 
photon counting mode. In this case rather than take the output divided by the 
mean gain as an estimate of the counts, one defines a threshold T such that 
an output > r is recorded as one photon, while an output < T is recorded 
as zero. Provided that one is operating in a regime where the chance of more 
than 1 e~ /pixel/exposure is small, this adds no extra noise to the output, and 
the signal-to-noise becomes 

^ (12) 



If g is large, this implies ideal, Poisson-limited performance. Such a device 
could give us data looking like the right- rather than the left-hand side of 
Fig.El 



Problems: thresholds, non-linearity and Clock-Induced Charges 

EMCCDs suffer from some drawbacks, that apply to normal CCDs but which 
are not usually apparent above readout noise. First of all, given the variable 
gain, which can in principle be as low as 1, a finite threshold must imply some 
loss of counts and therefore QE. To know how much, one needs to know the full 
probability distribution of the gain. For a single electron input, low p, high g 
case, the output probability distribution function (PDF) for n counts is quite 
well approximated hy g~^ exp —n/g [T. Figure [6] shows a comparison between 
an exact calculation of the PDF and this simple approximation, showing it to 
be good. This immediately implies that a threshold T leads to a decrease in 
the QE by factor of exp(— T/g). One therefore wants T g to avoid too large 
a loss in QE, but at the same time T must not be so low that readout noise 
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Number of counts 

Fig. 6. The solid line shows an exact computation of the output probability dis- 
tribution function of a 536 stage electron multiplying register with p — 0.011 given 
an input of 1 electron. The dashed line shows the approximation exp{—n/g)/g from 
[T] where n is the number of counts and the mean gain g = 352 in this case. The 
approximation only deviates significantly for values which are very rare 

alone leads to spurious counts, i.e. T > 4 - 5 times R, so that for gaussian 
noise there is a very small chance of counts induced artificially by readout 
noise. 

The next problem is a common feature of photon counting devices, which 
is non-linearity at "high" count rates. In this case significant non-linearity will 
set in when the mean count rate rises above ^ 0.2 e/pixel/exposure. Thus one 
can be Hmited by the rate at which pixels can be clocked out. This is exactly 
like the IPCS except that one can elect in the case of EMCCDs to work in the 
linear mode if it is clear that the count rates are too high for reliable photon 
counting. 

The third and worst problem is a new phenomenon, or at least one that 
is only revealed by the low noise capabilities of EMCCDs: "Clock Induced 
Charges" or CICs. These are electrons which are spontaneously produced 
during clocking. Their statistics are complex and I leave a detailed discussion 
of them to the appendix where I include calculations that as far as I am aware 
have not been published before. As far as we are concerned they are equivalent 
to a readout noise and lead to a variance (in the photon counting case) of the 
form ^ 

^ " ( 5 ) ^ ^ ^' ^^^^ 
where Rc is the number of CICs/pixel/exposure, equivalent to a readout noise 

1/2 

before avalanche amplification of i?J . The C here should be interpreted as 
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including the exp(— T/t;) factor discussed above, and the Rq rate wiU similarly 
be threshold dependent in this case. The CICs are better thought of as a 
source of readout noise than background counts because they are incurred 
per exposure, not per unit time. 

The three possible operating modes of CCDs are summarised in Fig.[71 For 
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Fig. 7. The three operating modes of CCDs are shown in terms of the exposure time 
taken to reach a given signal-to-noise compared to a "perfect" detector of identical 
QE. Normal mode (solid line) is best at high count levels drops off sharply once 
readout noise dominates. Photon counting mode (dot-dashed line) is best at very 
low count levels but cannot go past 1 e~ /pixel/exposure owing to non-linearity. In 
between, linear or "proportional" mode (dashed line) is best despite the 50% QE 
drop caused by the variable avalanche gain. The rise at very low counts is caused 
by CICs, with a rate of 0.01 e/pixel/exposure assumed here 



EMCCDs the focus is often on photon counting mode and the need to clock 
the pixels out fast to retain linearity. Figure [7] shows that there remains a role 
for the linear mode, the fraction of parameter space it occupies depending 
upon the CIC rate. Note that I have extended the regime of the linear mode 
in Fig. [7] towards lower counts than might seem justified by the curves; this is 
because of the non-linearity of photon counting. In principle it can be corrected 
to some extent, and this is built into Fig. [71 however in practice non-linearity 
corrections do not work as well as one might hope and I have restricted the 
photon counting mode to rates less than 0.3 e'^/pixel/exposure. 

Clearly the CIC rate is of central importance in how well these devices 
will perform in practice, as has been recognised before [11]. Rates in the 
range 0.004 to 0.1 CICs/pixel/exposure have been quoted [ID]. One can hope 
for manufacture-driven improvements with time, and controller improvements 
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have a significant role to play too [20 , but one should not lose sight of the 
fact that already even the high rate of 0.1 CICs/pixel/exposure is equivalent 
to an extremely low readout noise and can allow the count rate C to drop 
by a factor of 100 compared to normal CCDs before the noise floor is hit; 
this is an impressive potential gain and exactly what it needed to carry out 
high-time resolution spectroscopy on faint objects. Having said that, CICs do 
negate the QE advantages of CCDs compared to alternatives, in particular 
GaAs image intensifiers |11) . 

OPTICON-funded L3CCD for spectroscopy 

Given the uncertainties over CIC rates, thresholds, pixel clocking rates and 
the like, EMCCDs are very much in the development phase for astronomy. We 
don't yet know indeed whether they are reliable for spectroscopy in the sense 
that they can return accurate atomic line profiles, equivalent widths and ra- 
dial velocities. Thus as part of the OPTICON programme of the EU's Frame- 
work Programme 6 (FP6), the Universities of Sheffield and Warwick, and the 
Astronomy Technology Centre, Edinburgh have started a programme to char- 
acterise an EMCCD for spectroscopic work. We have purchased a CCD201 
which is a frame transfer device with a Ik x Ik imaging area and with one 
normal and one avalanche readout. As of mid-2006, the device is mounted 
in a cryostat and producing test data. We will be using hardware developed 
for the high-speed camera ULTRACAM [12]. In terms of imaging area, this 
device is not competitive with existing detectors on spectrographs. On top 
of this the ULTRACAM controller will not push the device to its limits and 
so clear improvements are possible even now. However, it will allow us to see 
whether these devices are a promising route to explore for future development 
of high-speed spectroscopy. 

4 Conclusion 

High-speed optical spectroscopy offers a number of unique diagnostics of ex- 
treme astrophysical environments and will become only more necessary as 
larger surveys discover more unusual objects. Unfortunately standard CCDs, 
the workhorses of optical astronomy, are not well suited to the high-speed and 
low noise required when light is dispersed. This may be changing with recent 
developments of avalanche gain CCDs, but they must be tested on celestial 
objects before we know whether this is truly the case. This is the aim of a 
programme funded under the EU FP6's OPTICON consortium. 
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A Clock Induced Charge statistics 

In this section we detail some of the statistical properties of CICs. CICs come 
in two varieties: "pre-register" and "in-register" events. Pre-register events 
suffer the full amplification of the avalanche stage and will have identical 
statistics to electrons generated from genuine signals. In-register events are 
amplified by a variable amount depending upon where in the avalanche regis- 
ter they are first produced. This leads to an extremely skewed distribution for 
these events with lots of low values but a tail extending to very high values 
as well as shown in Fig. [H The skewed distribution makes it hard to define 
CIC rates in a general way. For example, Fig. [8] shows that for the particular 
parameters chosen about 25% of pre-register CICs would on output fall below 
a threshold of 100, while 90% of the in-register CICs will do so. Lowering 
the threshold could increase the pre-register rate by at most 25%, but could 
increase the in-register rate by up to 10 times. When quoting rates, it is im- 
portant to define how they were measured. In this case a threshold of 100 will 
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Number of counts Number of counts 

Fig. 8. In the left panel the solid line shows the distribution of in-register CICs for 
a 536 stage electron multiplying register with p — 0.011 and CIC probability/stage 
of pc = 0.0011, i.e. this is the output PDF given a zero electron input. The dashed 
line shows the PDF given an input of 1 electron with pc set to zero. The right-hand 
panel shows the equivalent cumulative distribution functions, and illustrates that 
the parameters chosen here will give a CIC above 100 counts in about 1 in 10 pixels 



lead to a rate of 0.1 CIC/pixel/exposure, at the high end of total CIC rates 
(in- plus pre-register) gO] , and so presumably the probability of a CIC being 
generated at any one stage of the avalanche register is usually less than I have 
assumed. 

If the probability of a CIC being generated at any one stage of the 
avalanche register is pc, then one can show that the mean output value, 
given zero input fic is given by 

= —(<?-!), (14) 
P 

while the variance is 

2 + p ^' + '^-iT-p)^''- ^''^ 

For p<Cl, _pc ^1,5^1, this boils down to Vc ~ gf^c- For the example 
shown in Fig. [51 /ic = 35 while ac = 110. In proportional mode, this is effec- 
tively equivalent to a readout noise component of i? = ^^c/g = 0.3 entering 
the variance equation V = + 2C (ignoring the amplifier readout noise). A 
threshold of 100 leads to a very similar number for photon counting mode. 

There may be some room for optimisation of the threshold in the presence 
of significant numbers of in-register CICs. In the example shown, a threshold 
of 100 leads to a 25% reduction in the true event rate and a 10% CIC rate, 
so that the signal-to-noise would be 

(16) 



V0.1 + 0.75P 



where P is the mean number of photo-electrons per pixel per exposure. If the 
threshold is raised to 200, this becomes 
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VO.OS + O.STP' 

which is better for P < 0.057. 

The threshold cannot be optimised for pre-register events in the same way 
since they have identical statistics to genuine events. The relative numbers 
of in-register and pre-register events appears to depend upon the controller 
[40l [20] . My suspicion, which is possibly borne out by one of these studies 
[20] , would be that the high voltages required for the avalanche register will 
make it harder to reduce the in-register compared to pre-register event rates. 



